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Abstract 

The accuracy of a measurement of the spin direction of a spin-s par- 
ticle is characterised, for arbitrary half- integral s. The disturbance 
caused by the measurement is also characterised. The approach is 
based on that taken in several previous papers concerning joint mea- 
surements of position and momentum. As in those papers, a distinc- 
tion is made between the errors of retrodiction and prediction. Retro- 
dictive and predictive error relationships are derived. The POVM de- 
scribing the outcome of a maximally accurate measurement process 
is investigated. It is shown that, if the measurement is retrodictively 
optimal, then the distribution of measured values is given by the 
initial state SU(2) Q-function. If the measurement is predictively op- 
timal, then the distribution of measured values is related to the final 
state SU(2) P-function. The general form of the unitary evolution 
operator producing an optimal measurement is characterised. 
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1. Introduction 

In a recent series of papers |], g, H, [|, |^] we analysed the concept of experimental 
accuracy, as it applies to simultaneous measurements of position and momentum 
|?|, |], ||, |l0) . The purpose of this paper is to give a similar analysis for measurements 
of spin direction. 

There have been a number of previous discussions of joint, imperfectly accurate 
measurements of two (non-commuting) components of spin |0J. Measurements 
of spin direction — the kind of measurement considered in this paper — have been 
discussed by Busch and Schroeck Q , Grabowski [|l3) , Peres , and Busch et al || . 
In the following we extend the work of these authors by giving an analysis of the 
measurement errors, and of the conditions for a measurement process to be optimal. 
In particular, we will show that a measurement is retrodictively optimal if and only 
if the distribution of measured values is given by the generalized Q-function which 



is defined in terms of SU(2) coherent states [14| [JJJ |l6|, [17| (corresponding to an 
analogous property of joint measurements of position and momentum derived by Ali 
and Prugovecki [[18), and proved under less restrictive conditions in Appleby Q). 

This result provides us with some further insight into the physical significance of 
the SU(2) Q-function. It also has a bearing on the problem of state reconstruction. 
Amiet and Weigert jl^, ||(| have recently shown how, by making measurements 
of a single spin component for sufficiently many differently oriented Stern-Gerlach 
apparatuses, one can calculate the corresponding values of the SU(2) Q-function, 
and thereby reconstruct the density matrix. The fact that a retrodictively optimal 
measurement of spin direction has the Q-function as its distribution of measured 
values suggests an alternative approach to the problem of state reconstruction: for it 
means that one can reconstruct the density matrix from the statistics of a single run 
of measurements, performed on a single apparatus. The fact that measurements 
whose outcome is described the Q-function have this property of informational 
completeness has been stressed by Busch and Schroeck jl^] (also see Busch [ gj] , 
Busch et al g and Schroeck @). 

Retrodictively optimal joint measurements of position and momentum [Q, |l8| give 
rise to the ordinary Husimi or Q-function p2|, p3l [2^ | , and so they also have the 



property of informational completeness g, |, 21 , 2f|, at least in principle. However, 
the practical usefulness of this fact is somewhat restricted, due to the amplification 
of statistical errors which occurs when one attempts to perform the reconstruction 
starting from real experimental data jlO|, |26fl . No such difficulty arises in the case 
of measurements of spin direction, due to the fact that the state space is finite 
dimensional. 

We now outline the approach taken in the remainder of this paper. We consider 
a system consisting of a single spin, with angular momentum operator S satisfying 
the usual commutation relations [S^Sj,] = '^=i f « k ^ (with units chosen such 
that h = 1). We take it that S 2 = s(s + 1) for some arbitrary, but fixed half-integer 
s. 

The components of S are non-commuting, so they cannot all be simultaneously 
measured with perfect precision. However, they can all be measured with a less 
than perfect degree of accuracy. In order to do so one can use the same kind of 
procedure which is employed in the Arthurs-Kelly process j|, ^, 0, ||, ||, [l0|: that is, 
one can couple the non-commuting observables of interest — the components of S — 
to another set of "pointer" or "meter" observables which do commute, and whose 
values may therefore be simultaneously determined with arbitrary precision. 

The question we then have to decide is how to choose the pointer observables. 
The observables to be measured satisfy the constraint S 2 = s(s + 1), where s 
is fixed. Consequently, one might take the view that the magnitude of the spin 
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vector is already known, and that all that needs to be measured is its direction. 
This suggests that the pointer observables should be taken to be the (commuting) 
components of a unit vector n, satisfying the constraint n 2 — 1 . The direction of n 
measures the direction of S. We will refer to this as a type 1 measurement. Such 
measurements are discussed in Sections 

There is another possibility: for one could take the pointer observables to be the 
three independent, commuting components of a vector ft, no constraint being placed 
on the squared modulus pr. The value of Si (respectively S2, S3) is measured by 
/ti (respectively fe, /is). We will refer to this as a type 2 measurement. Such 
measurements are discussed in Section |^. 

We begin our analysis in Section ||, by characterising the POVM (positive oper- 
ator valued measure) describing the outcome of an arbitrary type 1 measurement 
process. 

In Section ^ we characterise the accuracy of and disturbance caused by a type 1 
measurement process. Our definitions are based on those given in Appleby Jl, 
for simultaneous measurements of position and momentum. In particular, we are 
led to make a distinction between two different kinds of accuracy, which we refer 
to as retrodictive and predictive. 

After giving, in Section |], a brief summary of the relevant features of the theory 
of SU(2) coherent states we go on, in Section ||, to describe retrodictively optimal 
type 1 measurements. We establish a bound on the retrodictive accuracy. We define 
a retrodictively optimal measurement to be a measurement which (1) achieves the 
maximum possible degree of retrodictive accuracy, and which (2) is isotropic (in a 
sense to be explained). We then show that the necessary and sufficient condition for 
the measurement to be retrodictively optimal is that the distribution of measured 
values be given by the initial state SU(2) Q-function. 

In Section |^ we establish a bound on the predictive accuracy of a type 1 mea- 
surement. We derive a necessary and sufficient condition for this bound to be 
achieved, in which case we say that the measurement is predictively optimal. We 
show that the distribution of measured values is then related to the final state SU(2) 
P-function. 

In Section ^ we consider completely optimal type 1 measurement processes — i.e. 
processes that are both retrodictively and predictively optimal. We give the general 
form of the unitary evolution operator describing such a process. 

Finally, in Section ||, we consider type 2 measurements. We define the retrod- 
ictive and predictive errors of such measurements, and establish bounds which the 
errors must satisfy. We then show that, in the limit as a type 2 measurement tends 
to optimality (retrodictive or predictive), it more and more nearly approaches an 
optimal type 1 measurement (with the replacement s _1 /i — > n). It follows that, in 
so far as the aim is to maximise the measurement accuracy, type 2 measurements 
have no advantages. 



2. Type 1 Measurements: POVM 

The purpose of this section is to characterise the POVM (positive operator valued 
measure) [fl |L ^ ^7], describing the outcome of an arbitrary type 1 measurement. 

We take a type 1 measurement to consist of a process in which the system, with 
2s + 1 dimensional state space !K sy , is coupled to a measuring apparatus, with state 
space 3"C a p- The interaction commences at a time t — ti when system+apparatus 
are in the product state \ip <E> Xa P ), where \ip) € JC sy is the initial state of the system 
and |xap) S 5C ap is the initial state of the apparatus. It ends after a finite time 
interval at t = if when system+apparatus are in the state U \ip ® Yap), where U is 
the unitary evolution operator describing the measurement interaction. 
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It should be stressed that this description is quite general. In particular, we are 
not making an impulsive approximation. Nor are we assuming that the interaction 
Hamiltonian is large in comparison with the Hamiltonians describing the system 
and apparatus separately. The only substantive assumption is the statement that 
system+apparatus are initially in a product state (so that they are initially uncor- 
rected). 

It should be noted that \ip) is arbitrary, since the system might initially be in 
any state £ !H sy . On the other hand |Xap) is fixed, since we assume that initially 
the apparatus is always in the same "zeroed" or "ready" state. 

As explained in Section [l], we take it that the result of the measurement is speci- 
fied by the recorded values of three commuting pointer observables n = (fa, fi2, n.3), 
satisfying the constraint J2r=i ™r = 1 ( so that there are only two pointer degrees of 
freedom). However, a measuring instrument does not usually consist of some point- 
ers, and nothing else. We therefore allow for the existence of N additional apparatus 
observables £ = (£1,.. . ,£n) which, together with the components of n, constitute 
a complete commuting set. The eigenkets |n, £) thus provide an orthonormal basis 
for M a p. 

The operator U specifies the final state of system+apparatus given any initial 
state € !K sy (& 3"C ap . However, we are only interested in initial states of the very 
special form \ip (8 Xap) , where |x a p) is fixed. In other words, the operator U provides 
us with much more information than we actually need. It turns out that all the 
quantities which are relevant to the argument of this paper can be expressed in 
terms of the operator T(n, £), defined by g, ||, |27[ |lj 

f(n,0= (<™il®M)#(K>®IXap>)|mi)<m 2 | (1) 

mi ,ni2--s 

where |m) denotes the eigenket of S3 with eigenvalue m (in units such that H = 1). 
The operator T(n,£) is more convenient to work with because, unlike U, it only 
acts on the system state space J{ sy . 

The significance of the operator T(n, £) is that it describes the change in the 
state of the system which is caused by the measurement process |9|, |27], |2^] (i.e. 
it describes the operation || || |27| induced by the measurement). In fact, suppose 
that the measurement is non-selective (meaning that the final value of n is not 
recorded, so that there is no "collapse"), and let pf be the reduced density matrix 
describing the final state of the system. It is then readily verified that 

(k = J dn^f(n,0 |V) (^1 ft(n,0 (2) 

where dn denotes the usual measure on the unit 2-sphcrc: in terms of spherical 
polars dn = sin 9ddd(j>. 

Let pvai( n ) be the probability density function describing the distribution of 
measured values: 

p val (n)=^ Jd€\((m\® (n,e|)L>(|V)®| X ap))| 2 (3) 

771 — — / 

(n) can also be expressed in terms of the operators T(n,£). In fact, define m, 



E(n) = J ^ft(n,£)T(n,0 (4) 

Then 

p val (n) = (^|^(n)|V) (5) 
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We see from this that E(n)dn is the POVM describing the measurement outcome. 
In particular 

E(n) > 

for all n and 



dnE(n) = 1 (6) 

Until now we have been assuming that the system is initially in a pure state. If 
the system is initially in the mixed state with density matrix p\ we have, in place 
ofEqs. (@) and (§, 

pi = j dn^T(n,e)ftTt(n,C) (7) 



and 

Pval 



(n) = Tr(^(n)A) (8) 

Eq. ([?]) gives the final state reduced density for the system in the case when the 
measurement is non-selective, so that the pointer position is not recorded. Suppose, 
on the other hand, that the final pointer position is recorded to be in the subset 3i 
of the unit 2-sphere. Then /5f is given by 

- [ dn [ ^T(n,£)/Sift(n,0 ( 9 ) 
where is the probability of finding n £ 31: 



Pi 

PR Jy. 



pji= dnp vai (n) 



3. Type 1 Measurements: Accuracy and Disturbance 

The purpose of this paper is to establish the form of the operators T(n, £) and 
E(n) when the measurement is optimal. In order to give a precise definition of what 
"optimal" means in this context, we first need to define a concept of measurement 
accuracy; which is the problem addressed in this section. We also discuss how to 
quantify the degree to which the system is disturbed by the measurement process. 

The approach we take is based on the approach taken in Appleby pi [|, to 
the problem of defining the accuracy of and disturbance caused by a simultaneous 
measurement of position and momentum. We thus work in terms of the Heisenberg 
picture. 

Let Si = S and n; = n be the initial values of the Heisenberg spin and pointer 
observables at the time t- u when the measurement interaction begins; and let Sf = 
U^SU and nf = U^hU be the final values of these observables at the time if, when 
the measurement interaction ends. Let § sy C IK sy be the unit sphere in the system 
state space. We then define the retrodictive fidelity i~n by 

Vi= inf ((^^Xapl |(n f -Si + Si-nf) |^®Xa P >) (10) 

and the predictive fidelity r/f by 

m= inl ( ® Xapl ■ Sf + S f • n f ) \ip <g> x ap ) ) 

(<V®Xap|nf 'Sf |V®Xap>) (11) 



= inf 

(where we have used the fact that the components of fif and Sf commute). It should 
be noted that the concept of fidelity employed here is somewhat different from the 
concept of fidelity which is employed in discussions of cloning and state estimation 
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(rji and r\i arc defined in terms of scalar products of observables, rather than scalar 
products of states). 

We also define the quantity r\& by 

Vd= inf f(^8>Xa P ||(SfSi + Si-Sf) |VOXap>) (12) 

The intuitive basis for these definitions is most easily appreciated if one thinks, 
temporarily, in classical terms. If interpreted classically 771 would represent the 
minimum expected degree of alignment between the final pointer direction and the 
initial direction of the spin vector. In other words, it would quantify the retrod- 
ictive accuracy of the measurement. On the other hand, rj[ would represent the 
minimum expected degree of alignment between the final pointer direction and the 
final direction of the spin vector: it would therefore provide a quantitative indica- 
tion of the predictive accuracy. Lastly, ?7d would quantify the extent to which the 
measurement disturbs the system, by changing the direction of the spin vector. 

Of course, fif, Si, Sf are in fact quantum mechanical observables, and so the 
physical interpretation of 771, rji and rjd needs to be justified much more carefully. 
Rather than proceeding directly, it will be convenient first to relate these quantities 
to an alternative characterisation of the measurement accuracy and disturbance. 
This will allow us to appeal to the arguments given in Appleby |l|, [|, to justify our 
earlier characterisation of the accuracy of and disturbance caused by a simultaneous 
measurement of position and momentum. It will also be helpful in Section |^, when 
we compare type 1 and type 2 measurements. 

In a type 1 measurement, the result of the measurement is a direction, repre- 
sented by the unit vector n. However, one could extract from this information 
estimates of the initial and final values of the spin vector itself by multiplying n 
by suitable constants: say as an estimate for S;, and Qn as an estimate for Sf. 
The question then arises: what are the best choices for these constants? 

To answer this question, consider the quantities 

sup ((4> <g) Xa P | lOnf - Sil 2 lip O Xap) ) = Cf - 2Ci?7i + s ( s + !) 

IV>>6S sy V ' 

and 

sup 

|V-)6S a , 

These expressions are minimised if we choose (i = r] u (f = r/f. We accordingly 
define the maximal rms error of retrodiction 

i_ 

A ci S=( sup ((V®Xa P | knf-Si| 2 |V®Xap}) I = (s + s 2 - rj?) * (13) 
and the maximal rms error of prediction 

A ef S=( sup ((V ® Xa P I hfn f - S f | 2 |V> «> Xap)) I - {s + s 2 -ii 2 Y (14) 
We also define the maximal rms disturbance by 

A d S=( sup f(V>®Xa P | |Sf-Si| 2 |V®Xa P >) ) = y/2 (s + s 2 - Vd y 2 (15) 
\IV)es sy v >) 

Comparing these expressions with those given in refs. Gl || it can be seen that A e iS 
plays the same role in relation to the kind of measurement here considered as do the 
quantities A C ja;, A ij» in relation to joint measurements of position and momentum; 
that ActiS is the analogue of A c fa;, A e fp; and that A^S is the analogue of Aja;, A^p. 



(<V> ® Xa P | ICffif - s f | 2 |v> «) xap)) = Cf - Km + s(s + 1) 
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A suitably modified version of the argument given in Section 5 of ref. |lj may then 
be used to show that A^S* (and therefore 771) describes the retrodictive accuracy of 
the measurement; that A e fS (and therefore r){) describes the predictive accuracy; 
and that A^S (and therefore ?7d) describes the degree of disturbance caused by the 
measurement. 

Finally, we note that the quantities 77;, r/i and 77^ can be expressed in terms of 
the operators T(n, and ^(n) defined earlier. In fact, comparing Eqs. (|l|) and ([|) 
with Eqs. (0-|l|) one finds 

?7i= inf f [ dnU\UE(n)n-S + n-SE(n)) \if>) ) (16) 
Wes sy \J / 

m = ^M^J dnde^|ft(n,On-§T'(n,0|^>) (17) 

and 

Vd = inf ( f dndt; V<7/-|i(ft(n,e)5aT(n,C)5 a 

+S a f\n,OS a f(n,0)\^)) (18) 



4. SU(2) Coherent States 

The task we now face is to establish upper bounds on the fidelities 7ft, ?7f (or, 
equivalently, lower bounds on the errors A C iS, A c fS), and then to establish the form 
of the operators T(n, £), E(n) for which these bounds are achieved. The theory of 
SU(2) coherent states will play an important role in the argument. In order to fix 
notation we begin by summarising the relevant parts of this theory. For proofs of 
the statements made in this section see refs. [Q |l5], |TJ|, [L7| . 

For each unit vector n £ R 3 choose a vector 8 n £ M 3 with the property 

exp[-i0 n • S] S3 exp[i8 n ■ S] = n ■ S 

Define 

|n,m) = exp[-i0 n • S] |m) (19) 
where |m) is the normalized eigenvector of £3 with eigenvalue m. We then have 

n ■ S |n, to) = m, |n, m) (20) 

and 

/dn In, m) (n, ml = 1 

for all to. 

We are especially interested in the states |n, s). These are the minimum un- 
certainty states, for which ^2 a =i(ASa) — s. To denote them we employ the 
abbreviated notation 

|n) = |n, S ) (21) 

The states |n) so defined £ 3{ sy and are eigenvectors of n • S. They need to be 
carefully distinguished from the states |n, £) which £ 3-C ap and are eigenvectors of 
fi. 

Let A be any operator acting on !H S y . The covariant symbol corresponding to A 
is defined by 

A cv (n) = (n|i|n) 
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The contravariant symbol corresponding to A is defined to be the unique function 

^ = J dnA cn (n) |n) (n| 



and which satisfies 

J dn'n 2s (n,n')A cn (n') = A cn (n) 
where Il2 S (n, n') is the projection kernel 

IT 2s (n, n') = f; £ y jm (n)r; m (n') = £ ^P,(n ■ n') (22) 

j=0 m=—j j=0 

In these expressions the Yj m are spherical harmonics and the Pj are Legendre 
polynomials. 

It can be shown that, given any square integrable function /, 

A = ^r J rfn /( n )i n )< n i ( 23 ) 

if and only if 



/ 



dn'n 2s (n,nO/(n') = A cn (n) (24) 



for almost all n. 

The covariant (respectively contravariant) symbol of an operator is often referred 
to as the Q (respectively P) symbol of that operator. However, we will find it more 
convenient to reserve this notation for the symbols corresponding specifically to the 
density matrix, scaled by a factor (2s + 1)/(Att): 

Q(n) = ^^Pcv(n) (25) 

P(n) = ^Pc„(n) (26) 

With this rescaling the Q and P-functions satisfy the normalisation condition 

J dnQ(n) = J dnP(n) = 1 

In particular, Q(n) is a probability density function. As we will see, it is in fact 
the probability density function describing the outcome of a retrodictively optimal 
type 1 measurement. 

5. Retrodictively Optimal Type 1 Measurements 

The purpose of this section is to investigate those processes which maximise the 
retrodictive fidelity. We begin by establishing the following bound on 77;: 

m < s (27) 



which, in view of Eq. (13), implies 

A ci S > (28) 



We will refer to Inequality ([28|) as the retrodictive error relation. It can be seen 
that it has the same form as the ordinary uncertainty relation, AS > \fs. It 
is the analogue, for the kind of measurement here considered, of the inequality 
A C {X A 01 p > 1/2 proved in ref . || for joint measurements of position and momentum 
(in units such that K = 1) . 



In order to prove this result we note that it follows from Eqs. ^ and ( |l6| ) that 
(2s + l) m < j dnd£ Tr(n • S ft(n, £)f (n, £)) 
In view of Eqs. (|j) and (||) we also have 

/" dndt Tr(ft(n,e)f(n,0) - (2s + 1) 

Consequently 

1 rfnrfe ■&((»» - n • S)f t(n, £)T(n, £)) < (29) 

For each fixed n the kets |n, m) defined by Eq. ([l9]) constitute an orthonormal basis. 
We may therefore write 

s 

f(n,0= J2 T mm ,(n,0\n,m)(n,m'\ (30) 

m,m' — ~s 



for suitable coefficients T mm t. Substituting this expression in Inequality (29) gives 
J2 ((»» - m ') / dnd ^ KOI 2 ) < (31) 

7n,m' = — s 

Inequality (|27| ) is now immediate. 

We next show that the retrodictive fidelity achieves its maximum value r/i = s if 
and only if E(n) is of the form 

E(n) = ^5(n) |n) (n| (32) 

for almost all n, where |n) is the state defined by Eq. (pl|), and where g is any 
function satisfying 

J dn'U 2s (n,n')g(n') = 1 (33) 



(34) 



[Il2 S (n, n') being the projection kernel defined by Eq. (|22| 
In fact, setting i~n = s in Inequality (|3l]) gives 



E ((*-"*') J dnd£|T mm ,(n,0| 2 ) <0 



m,m'— — s 



from which it follows that the coefficients T mm i must be of the form 

(2s + l\ ' 
T mm > (n, = ( 4?r J s m'i g m (n, 

for almost all n, £. Substituting this expression into Eq. (^) gives 

f(n,0=(^) 2 |.9(n,0}(n| (35) 
for almost all n, £, where 

s 

ls( n >0) = •9™( n ^) |n,m) 

m=— s 

Setting 

ff (n) = | ^||| 5 (n,e))|| 2 

and using Eq. (jij), we deduce that -E(n) is of the form specified by Eq. (|32|). It 
follows from Eqs. (^), (23) and (|24|), and the fact that id cn (n) = 1, that the function 
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g must satisfy Eq. (p3|). This proves that the condition represented by Eqs. (|32| ) 



and (33) is necessary. 

Suppose, on the other hand, that E(n) is given by Eq. (|32|), with g satisfying 
Eq. (§|). Using Eqs. @, (||) and @ we deduce 



^ = Pf ( 2S , + 1 / Ais<?(n)|(n| ?A) 

|^)es ay V 4tt J 



which shows that the condition is also sufficient. 

The condition rji = s is not, by itself, enough to determine the distribution 
of measured values. However, the requirement that the retrodictive fidelity be 
maximised is not the only property which it is natural to require of a measurement 
that is to count as optimal. It is also natural to require that the measurement 
does not pick out any distinguished spatial directions. We accordingly define an 
isotropic measurement to be one which has the property that, if the initial system 
state density matrix takes the rotationally invariant form 

then the distribution of measured values is also rotationally invariant: 

Pval(n) = — 

for all n. 

We define a retrodictivcly optimal type 1 measurement process to be an isotropic 
process for which the retrodictive fidelity is maximal, rji = s. It is then straightfor- 
ward to verify that a type 1 measurement process is retrodictively optimal if and 
only if E(n) = (2s + l)/(47r) |n) (n|. This is the POVM which has previously been 
discussed by Busch and Schroeck [Q, and others p|, p|, 

We see from Eq. (||) that the measurement is retrodictively optimal if and only 
if the distribution of measured values is given by 

p va i(n) = Qi(n) 

for all n, where Q\ is the Q-function corresponding to the initial system state density 
matrix: 

2s + 1 , , 
QiW = ( n l Pi l n ) 

In terms of the operator T(n, £), the necessary and sufficient condition for a type 
1 measurement to be retrodictively optimal is that [see Eq. (|35|)] 

f(n,0=(^) 2 |.9(n,0)(n| (36) 
where \g(n,^)) is any family of kets with the property 



/ 



dd| ff (n,£)}|| 2 = l (37) 



for all n. 

We conclude this section by showing that for retrodictively optimal type 1 mea- 
surements (Si) = (s + l)(Af). In fact 

(ip ® Xa P | n f \ip <g> Xa P ) = J dnn (ip\ E(n) \ip) 

= — — — / dn n I (ib I n) 1 2 
4tt J 

= 7Xy(^ *ap| Si |V> ® Xap) 
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where we have used the fact 15 that (s + l)n is the contravariant symbol corre- 
sponding to S. 

6. Predictively Optimal Type 1 Measurements 

The purpose of this section is to characterise the form of the operator T(n, £) 
and function p va i(n) for processes which maximise the predictive fidelity, r/{ . In the 
last section we showed that, for retrodictively optimal type 1 measurements, p V ai 
coincides with the initial system state Q-function. In this section we will show that 
if the measurement is predictively optimal, then p va i is related to the final system 
state P-function. 

We begin by establishing an upper bound on r]{. By a similar argument to the 
one leading to Inequality ( p9| ) we find 

J dnd£_ Tr((7/ f -n-S)f(n,C)T t (n,^)) < 

which only differs from Inequality ( p9| ) in the replacement of r]i by rjf , and in the fact 
that the order of T(n, £) and TT(n,£) is reversed. The analysis therefore proceeds 
in nearly the same way. Corresponding to Inequality (E7n wc deduce 



rjt < s (38) 



which, in view of Eq. (14), implies 

ActS" > Vs (39) 



We will refer to Inequality (39) as the predictive error relation. It is the analogue, 
for measurements of spin direction, of the inequality A fX A c tp > 1/2 proved in 
rcf. H for joint measurements of position and momentum (units chosen such that 
th = l). 

We define a predictively optimal type 1 measurement to be one for which the 
predictive fidelity is maximal, r/{ — s (unlike the case of retrodictive optimality, we 
do not impose the requirement that the measurement also be isotropic). By a similar 
argument to the one given in the last section we find, corresponding to Eqs. ( |36| ) 
and (|37|), that the necessary and sufficient condition for a type 1 measurement to 
be predictively optimal is that T(n, £) be of the form 

f(n,0=(^) 2 |n)</i(n,0l (40) 

for almost all n, £, where |/i(n,£)) is any family of kets satisfying the completeness 
relation 

?^Jdndt\h(n ) e)){h(n,Z)\ = l (41) 

If f (n, is of this form it follows from Eqs. (|) and (§) that 

Pval (n) = ^l [ d4(h(n,0\pi\h(n,0) (42) 

where p\ is the initial system density matrix. Now suppose that the measured value 
of n has been recorded to lie in the region 31 of the unit 2-sphere. Then, using 
Eqs. (|), © and @, we find 



Pi = — [ dnp va i(n) |n) (n| 
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where pji is the probability of recording the result n £ 31, and where pt is the final 
system reduced density matrix. In view of Eqs. (|2^), ( pij ) and (^6|) this means that 
the final system state P-function Pf is given by 



P f (n) = — / dn'n 2s (n,n')pvai(n') 



for almost all n, where IT2 S is the projection kernel defined by Eq. (|22|). 
If 31 is a sufficiently small region surrounding the point no then 

p { w |n > (no | 

Finally, we note that for a predictively optimal type 1 measurement (Sf) = s(nf). 
In fact 

<^Xap|S f |V^Xa P ) = J dad£(il>\'P(n,t)&f(n,S)\il>) 

= J dnde(n|S|n)(^|ft(n,0>(/»(n,0|V'> 

rfnn(^| P(n) l^i) 

= s(^(8Xap|nf |V>®Xap) 

where we have used the fact |l5) that sn is the covariant symbol corresponding to 
S. 

7. Completely Optimal Type 1 Measurements 
We define a completely optimal type 1 measurement to be one which is both 



retrodictively and predictively optimal. Referring to Eqs. fl3q), (|37|), (40), and (|4l| ) 
we see that the necessary and sufficient condition for this to be true is that T(n, £) 
be of the form 

f(n,0= (^) 2 /(n,0|n) (n| 
where / is any function with the property 

J d£ |/(n,a| 2 = l 

for all n. 

Expressed in terms of the operator U the condition reads [see Eq. (p])] 

i_ 

((mil ® (n,£\)U(\m 2 ) | Xap » = (^^) " /(n,0 (™i | n) (n| m 2 ) 

It is straightforward to verify that there do exist unitary operators U with this 
property. It follows that completely optimal measurements are defined mathemat- 
ically. The question as to whether they are possible physically is, of course, rather 
less straightforward. 

Referring to Eq. (|l^ ) we see that, for a completely optimal measurement, the 
quantity r/d, characterising the extent to which the system is disturbed by the 
measurement process, is given by 



(Is + 1 f s / , 

rid = inf — / dn - ( (ib n)(n n • S \w) + (tp n • S n)(n \ib) 

|V)eS sy \ Air J 2 V x 1 ' N 1 1 ' N 1 i / \ i / 



s 2 



(43) 
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where we have used the fact 15 that sn is the covariant symbol corresponding to 



S. In view of Eq. (|15|) it follows that 

A d S = V2s 

8. Type 2 Measurements 

In the preceding sections we have been concerned with type 1 measurements, 
for which the pointer position is constrained to lie on the unit 2— sphere. We now 
turn our attention to type 2 measurements. As explained in the Introduction, these 
are measurements for which the outcome is represented by the three independent 
commuting components of a vector ft, no constraint being placed on the squared 
modulus ft? — Y] ft? a . We will show that, the more nearly a type 2 measurement 
approaches to optimality, the more nearly it approximates an (optimal) type 1 
measurement. 

We first need to characterise the accuracy of a type 2 measurement. A similar 
analysis to that given in Section]^ can be carried through for type 2 measurements, 
with the replacement n — * fi. As before, we denote the additional apparatus degrees 
of freedom £ = (£i, . . . , £,n), so that the eigenkets £) comprise an orthonormal 
basis for the apparatus state space, Jf ap . Let |Xap) be the intial apparatus state, 
and let U be the unitary operator describing the evolution brought about by the 
measurement interaction. Then, if the initial system state is \ip), the final state of 
system+apparatus, immediately after the measurement interaction has ended, will 
be given by U \ip <S> Xap)- Corresponding to Eqs. (|l|) and (J4j> we define 

s 

£|)f/(|m') ® |Xap)) \m) (m'| 

m,m' — — s 

and 

E{ti) = J deft( M ^)f( M)€ ) (44) 

Corresponding to Eqs. ( |l3|) and ( |l4|) we define the maximal rms errors of retrodic- 
tion and prediction by 

i 

A ci S = sup ((^®Xa P | |Af-Si| 2 k®Xa P )) (45) 
\W>es B v ' I 



and 



Aef5=( SUp f(^® Xapl |Af - Sfl 2 |V> ®Xap)) ) (46) 

\iv->es ay v 

where S ; = S, S f = WSU and ft { = U^SU. It can be seen that Eq. (Eq) agrees 
with Eq. (13) if one replaces ft { — > ^fif, and that Eq. ( |46| ) agrees with Eq. ( pT] ) if 
one replaces /i f — > ^fAf . 

In terms of the operators E((i) and T(/i, £) we have 



and 



AciS* = SUp ( / d/J, V7?A| (Ma - £ a ) £(/x) (fi a - S a ) \lp) 

\\4>)es sy \J a=1 
A ei S=l sup ( fdnd£(il,\ft (»,0\ft-S\ 2 f | V)) 



(47) 



(48) 
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We next show that, corresponding to Inequality (p8|), one has the retrodictive 
error relationship for type 2 measurements 

A ei S > yfs (49) 

and that, corresponding to Inequality (^), one has the predictive error relationship 
for type 2 measurements 

A ef S > y/s (50) 



In fact, it follows from Eqs. (J4J) , Q47]) and (|48|) that 

(2.s + 1) (A ci S) 2 > Jdndt Tr(|/x-S| 2 ft(/x,0T(/x,0 

and 

(2 S + l)(A cf S) 2 > J ^Tr(|/x-S| 2 f(/x,C)Tt( M ,0) 
Using the fact 

(2s + 1) = J d/*d£ Tr (f t( M , f(ji, 0) 



we deduce 



1 d M de , &((|M-S| a -(AeiS) 2 )ft( A i ) 0T , (M,0) <0 

and 

1 d M dei>r(|/*-§r-(Aef5) 2 )T , (/i J e)T t (M»0) <0 (W) 



(51) 



Now make the expansion 

s 

f{ii, t)= J2 T ™»' (**> |n, m) (n, m'| 



m,m — — s 



where n = and |n, m) is the state defined by Eq. (|19[). Using this expansion 
Inequalities |3l]) and ( |52"| ) become 

I «((^-m') 2 + Gs 2 -™' 2 ) + (.s-(A ci 5) 2 ))|Tw(^O| 2 <0 

m,m' — — s 

(53) 

and 

J2 I dfidt ((^ ~ m) 2 + (s 2 - m 2 ) + (s - (A ci Sf))\T mm ,(»,Z)\ 2 <0 



/?; .//)'_ — ,s 



(54) 

Inequalities (|i^) and ( |50"| ) are now immediate. 
Setting AeiS* = y^s in Inequality ( |53| ) gives 

]T / d^((M-m') 2 + (s 2 -m /2 ))|T w (/i,e)| 2 <0 



nun'— — s 



which implies 

\T m m'(ii,£)\ = g m (n,£)5m's5((t - s) 

for suitable functions g m . However, this is not possible, since the square root of the 
(5-function is not defined. It follows that the lower bound set by Inequality ( J49| ) is 
not precisely achievable. Nor is the lower bound set by Inequality (J50|). 
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It is, however, possible to approach the lower bounds set by Inequalities (|i9|) 
and ([SO]) arbitrarily closely. It can be seen that as A C1 S — > y/s (respectively, A e f S — > 
•y/s), then T(/x, £) and £7(/Lt) become more and more strongly concentrated on the 
surface /i — s. In other words, the measurement more and more nearly approaches 
a type 1 measurement of maximal retrodictive (respectively, predictive) accuracy, 
with pointer observable n = jx/s. 

9. Conclusion 

There are a number of ways in which one might seek to develop the results 
reported in this paper. 

In the first place, although we showed that A^S = \J~2s~ for a completely optimal 
type 1 measurement, we did not derive error-disturbance relationships, analogous to 
the inequalities A C jX AdP, A C jpAdX, A c fX AdP, A c fp Ada; > 1/2 (in units such that 
H = 1) proved in ref. || for the case of a simultaneous measurement of position 
and momentum. The general principles of quantum mechanics (2^, [50| indicate 
that relationships of this kind must also hold for measurements of spin direction, 
at least on a qualitative level. However, it appears that the problem of giving 
the relationships precise, numerical expression is not entirely straightforward. The 
question requires further investigation. 

In this paper we have considered measurements of spin direction. However, the 
problem of simultaneously measuring just two components of spin is also impor- 
tant [O, |l2|]. It would be interesting to investigate the accuracy of measurements 
such as this, and to try to characterise the POVM (or POVM's, in the plural?) 
describing the outcome when the measurement is optimal. 

We have seen that SU(2) coherent states play an important role in the descrip- 
tion of optimal measurements of spin direction. In refs. |Q, [l8| it was shown that 
ordinary, Heisenberg-Weyl coherent states play an analogous role in the description 
of optimal joint measurements of position and momentum. It would be interesting 
to see if it is generally true, that every system of generalized coherent states is 
related in this way to joint measurements of the generators of the corresponding 
Lie group. 

There are some important questions of principle regarding measurements of a 
single spin component || |l2|, |3l], [52], It would be interesting to see if the 

approach to the problem of defining the measurement accuracy which was described 
in this paper can be used to gain some additional insight into these questions. 

Finally, it is obviously important to investigate whether optimal, or near optimal 
determinations of spin direction can be realised experimentally. 
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